Spotting Overgeneration Suspects
نویسندگان
چکیده
We present a method for quickly spotting overgeneration suspects (i.e., likely cause of overgeneration) in hand-coded grammars. The method is applied to a medium size Tree Adjoining Grammar (TAG) for French and is shown to help reduce the number of outputs by 70% almost all of it being overgeneration.
منابع مشابه
The typological consequences of weighted constraints
A common ‘typological criterion’ on linguistic models is that they should predict (almost) all observed patterns while minimizing overgeneration. For optimizationbased models, it has been argued that constraints should be ranked rather than weighted to minimize overgeneration. Recently, however, weighting has been shown to elegantly capture patterns that ranking misses. To evaluate the issue, w...
متن کاملOvergeneration and Ranking for Spoken Dialogue Systems
We describe an implemented generator for a spoken dialogue system that follows the ‘overgeneration and ranking’ approach. We find that overgeneration based on bottom-up chart generation is wellsuited to a) model phenomena such as alignment and variation in dialogue, and b) address robustness issues in the face of imperfect generation input. We report evaluation results of a first user study inv...
متن کاملPrinciple-Based Parsing without Overgeneration
Overgeneration is the main source of computational complexity in previous principle-based parsers. This paper presents a message passing algorithm for principle-based parsing that avoids the overgenera-tion problem. This algorithm has been implemented in C++ and successfully tested with example sentences from (van Riemsdijk and Williams, 1986).
متن کاملPhonological Overgeneration in Paninian System
In this paper an attempt is made to study the problem of overgeneration that is caused by the application of the system of P ān. ini The system of P ān. ini is made up of certain rules stated by him and his commentators namely, Kātyāyana and Patañjali. These rules are supposed to produce the forms that are used in the language, i.e. Sanskrit. However, sometimes the technical application of thes...
متن کاملIn Proceedings of ACL - 93 PRINCIPLE - BASED PARSING WITHOUT OVERGENERATION 1
Overgeneration is the main source of computational complexity in previous principle-based parsers. This paper presents a message passing algorithm for principle-based parsing that avoids the overgenera-tion problem. This algorithm has been implemented in C++ and successfully tested with example sentences from (van Riemsdijk and Williams, 1986).
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007